electrical and electronics engineer inc
A Comprehensive Dataset for Underground Miner Detection in Diverse Scenario
Addy, Cyrus, Gurumadaiah, Ajay Kumar, Gao, Yixiang, Awuah-Offei, Kwame
Underground mining operations face significant safety challenges that make emergency response capabilities crucial. While robots have shown promise in assisting with search and rescue operations, their effectiveness depends on reliable miner detection capabilities. Deep learning algorithms offer potential solutions for automated miner detection, but require comprehensive training datasets, which are currently lacking for underground mining environments. This paper presents a novel thermal imaging dataset specifically designed to enable the development and validation of miner detection systems for potential emergency applications. We systematically captured thermal imagery of various mining activities and scenarios to create a robust foundation for detection algorithms. To establish baseline performance metrics, we evaluated several state-of-the-art object detection algorithms including YOLOv8, YOLOv10, YOLO11, and RT-DETR on our dataset. While not exhaustive of all possible emergency situations, this dataset serves as a crucial first step toward developing reliable thermal-based miner detection systems that could eventually be deployed in real emergency scenarios. This work demonstrates the feasibility of using thermal imaging for miner detection and establishes a foundation for future research in this critical safety application.
- North America > United States > New Jersey > Mercer County > Princeton (0.04)
- North America > United States > Missouri > Phelps County > Rolla (0.04)
- Europe > Switzerland > Basel-City > Basel (0.04)
- (2 more...)
An Integrated Framework for Contextual Personalized LLM-Based Food Recommendation
Personalized food recommendation systems (Food-RecSys) critically underperform due to fragmented component understanding and the failure of conventional machine learning with vast, imbalanced food data. While Large Language Models (LLMs) offer promise, current generic Recommendation as Language Processing (RLP) strategies lack the necessary specialization for the food domain's complexity. This thesis tackles these deficiencies by first identifying and analyzing the essential components for effective Food-RecSys. We introduce two key innovations: a multimedia food logging platform for rich contextual data acquisition and the World Food Atlas, enabling unique geolocation-based food analysis previously unavailable. Building on this foundation, we pioneer the Food Recommendation as Language Processing (F-RLP) framework - a novel, integrated approach specifically architected for the food domain. F-RLP leverages LLMs in a tailored manner, overcoming the limitations of generic models and providing a robust infrastructure for effective, contextual, and truly personalized food recommendations.
- North America > United States > California > Orange County > Irvine (0.28)
- North America > United States > New York > New York County > New York City (0.14)
- North America > United States > Maine (0.04)
- (14 more...)
- Workflow (1.00)
- Research Report > Promising Solution (1.00)
- Research Report > New Finding (1.00)
- (2 more...)
- Leisure & Entertainment (1.00)
- Law (1.00)
- Information Technology > Security & Privacy (1.00)
- (14 more...)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
- (3 more...)
Cal or No Cal? -- Real-Time Miscalibration Detection of LiDAR and Camera Sensors
Tahiraj, Ilir, Swadiryus, Jeremialie, Fent, Felix, Lienkamp, Markus
The goal of extrinsic calibration is the alignment of sensor data to ensure an accurate representation of the surroundings and enable sensor fusion applications. From a safety perspective, sensor calibration is a key enabler of autonomous driving. In the current state of the art, a trend from target-based offline calibration towards targetless online calibration can be observed. However, online calibration is subject to strict real-time and resource constraints which are not met by state-of-the-art methods. This is mainly due to the high number of parameters to estimate, the reliance on geometric features, or the dependence on specific vehicle maneuvers. To meet these requirements and ensure the vehicle's safety at any time, we propose a miscalibration detection framework that shifts the focus from the direct regression of calibration parameters to a binary classification of the calibration state, i.e., calibrated or miscalibrated. Therefore, we propose a contrastive learning approach that compares embedded features in a latent space to classify the calibration state of two different sensor modalities. Moreover, we provide a comprehensive analysis of the feature embeddings and challenging calibration errors that highlight the performance of our approach. As a result, our method outperforms the current state-of-the-art in terms of detection performance, inference time, and resource demand. The code is open source and available on https://github.com/TUMFTM/MiscalibrationDetection.
- Transportation (0.35)
- Information Technology (0.35)
- Automobiles & Trucks (0.35)
- Information Technology > Artificial Intelligence > Vision (1.00)
- Information Technology > Artificial Intelligence > Robots (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)
Experimentation, deployment and monitoring Machine Learning models: Approaches for applying MLOps
Nogare, Diego, Silveira, Ismar Frango
In recent years, especially since 2010, Data Science has proven to be a fundamental discipline and a support tool for the industry to improve its decision-making supported by data. With the increased relevance of this area, the challenges of publishing the developed models into production to deliver the proposed value to end-users have become more prominent To address these challenges, the MLOps discipline has proven to be a promising approach, enabling the automation and governance of the processes of experimenting, publishing and monitoring Machine Learning models. The creation of MLOps pipelines is one of the main strategies to ensure the effectiveness and efficiency of these processes. This work is expected to contribute to the advancement of AI, promoting more efficient and transparent methodologies for end-to-end Machine Learning project development, looking for to answer the investigative question "What are the main challenges faced by companies when publishing Machine Learning models into production, and how can the discipline of MLOps helps overcome them?" and more specific questions like "Why is it necessary to carry out continuous monitoring throughout the entire development lifecycle of machine learning models?" and "What are the essential steps to ensure an automated, efficient, and secure environment for publishing machine learning models?". The remainder of the paper is organised as follow: in section 2 - MLOps pipeline, which explains the concepts and challenges of MLOps pipelines, in section 3 - Application and Case Study, applications and the benefits of implementing a solution with the stages of experimentation, publication and monitoring and three case studies from different fields of the industry that benefited from the implementation of MLOps are presented, and, in section 4 - Conclusion, the views of each of the three major areas explored are exposed.
- North America > Canada > Quebec > Montreal (0.05)
- South America (0.04)
- North America > Central America (0.04)
- Europe > Switzerland (0.04)
An Encoding Framework for Binarized Images using HyperDimensional Computing
Smets, Laura, Van Leekwijck, Werner, Tsang, Ing Jyh, Latré, Steven
Hyperdimensional Computing (HDC) is a brain-inspired and light-weight machine learning method. It has received significant attention in the literature as a candidate to be applied in the wearable internet of things, near-sensor artificial intelligence applications and on-device processing. HDC is computationally less complex than traditional deep learning algorithms and typically achieves moderate to good classification performance. A key aspect that determines the performance of HDC is the encoding of the input data to the hyperdimensional (HD) space. This article proposes a novel light-weight approach relying only on native HD arithmetic vector operations to encode binarized images that preserves similarity of patterns at nearby locations by using point of interest selection and local linear mapping. The method reaches an accuracy of 97.35% on the test set for the MNIST data set and 84.12% for the Fashion-MNIST data set. These results outperform other studies using baseline HDC with different encoding approaches and are on par with more complex hybrid HDC models. The proposed encoding approach also demonstrates a higher robustness to noise and blur compared to the baseline encoding.
- Europe > Belgium > Flanders > Antwerp Province > Antwerp (0.04)
- Europe > Switzerland (0.04)
- Health & Medicine (0.46)
- Information Technology (0.34)
Advancements In Crowd-Monitoring System: A Comprehensive Analysis of Systematic Approaches and Automation Algorithms: State-of-The-Art
Ameen, Mohammed, Stone, Richard
Growing apprehensions surrounding public safety have captured the attention of numerous governments and security agencies across the globe. These entities are increasingly acknowledging the imperative need for reliable and secure crowd-monitoring systems to address these concerns. Effectively managing human gatherings necessitates proactive measures to prevent unforeseen events or complications, ensuring a safe and well-coordinated environment. The scarcity of research focusing on crowd monitoring systems and their security implications has given rise to a burgeoning area of investigation, exploring potential approaches to safeguard human congregations effectively. Crowd monitoring systems depend on a bifurcated approach, encompassing vision-based and non-vision-based technologies. An in-depth analysis of these two methodologies will be conducted in this research. The efficacy of these approaches is contingent upon the specific environment and temporal context in which they are deployed, as they each offer distinct advantages. This paper endeavors to present an in-depth analysis of the recent incorporation of artificial intelligence (AI) algorithms and models into automated systems, emphasizing their contemporary applications and effectiveness in various contexts.
- North America > United States > Iowa (0.04)
- North America > United States > New York (0.04)
- North America > United States > Connecticut (0.04)
- (13 more...)
Audio classification using ML methods
Abstract-- Machine Learning systems have achieved outstanding performance in different domains. In this paper machine learning methods have been applied to classification task to classify music genre. The code shows how to extract features from audio files and classify them using supervised learning into 2 genres namely classical and metal. Machine Learning is used to classify the audio files into 2 genres classical and metal. A total of 20 audio files, 10 for each genre respectively are taken.
Spatial-Temporal Anomaly Detection for Sensor Attacks in Autonomous Vehicles
Higgins, Martin, Jha, Devki, Wallom, David
Time-of-flight (ToF) distance measurement devices such as ultrasonics, LiDAR and radar are widely used in autonomous vehicles for environmental perception, navigation and assisted braking control. Despite their relative importance in making safer driving decisions, these devices are vulnerable to multiple attack types including spoofing, triggering and false data injection. When these attacks are successful they can compromise the security of autonomous vehicles leading to severe consequences for the driver, nearby vehicles and pedestrians. To handle these attacks and protect the measurement devices, we propose a spatial-temporal anomaly detection model \textit{STAnDS} which incorporates a residual error spatial detector, with a time-based expected change detection. This approach is evaluated using a simulated quantitative environment and the results show that \textit{STAnDS} is effective at detecting multiple attack types.
- Information Technology > Security & Privacy (1.00)
- Transportation > Ground > Road (0.47)
ASAP: Adaptive Transmission Scheme for Online Processing of Event-based Algorithms
Tapia, Raul, Dios, José Ramiro Martínez-de, Eguíluz, Augusto Gómez, Ollero, Anibal
Online event-based perception techniques on board robots navigating in complex, unstructured, and dynamic environments can suffer unpredictable changes in the incoming event rates and their processing times, which can cause computational overflow or loss of responsiveness. This paper presents ASAP: a novel event handling framework that dynamically adapts the transmission of events to the processing algorithm, keeping the system responsiveness and preventing overflows. ASAP is composed of two adaptive mechanisms. The first one prevents event processing overflows by discarding an adaptive percentage of the incoming events. The second mechanism dynamically adapts the size of the event packages to reduce the delay between event generation and processing. ASAP has guaranteed convergence and is flexible to the processing algorithm. It has been validated on board a quadrotor and an ornithopter robot in challenging conditions.
- Information Technology > Artificial Intelligence > Vision (1.00)
- Information Technology > Artificial Intelligence > Robots (1.00)
Time Series Prediction using Deep Learning Methods in Healthcare
Morid, Mohammad Amin, Sheng, Olivia R. Liu, Dunbar, Joseph
Traditional machine learning methods face two main challenges in dealing with healthcare predictive analytics tasks. First, the high-dimensional nature of healthcare data needs labor-intensive and time-consuming processes to select an appropriate set of features for each new task. Second, these methods depend on feature engineering to capture the sequential nature of patient data, which may not adequately leverage the temporal patterns of the medical events and their dependencies. Recent deep learning methods have shown promising performance for various healthcare prediction tasks by addressing the high-dimensional and temporal challenges of medical data. These methods can learn useful representations of key factors (e.g., medical concepts or patients) and their interactions from high-dimensional raw or minimally-processed healthcare data. In this paper we systematically reviewed studies focused on advancing and using deep neural networks to leverage patients structured time series data for healthcare prediction tasks. To identify relevant studies, MEDLINE, IEEE, Scopus and ACM digital library were searched for studies published up to February 7th 2021. We found that researchers have contributed to deep time series prediction literature in ten research streams: deep learning models, missing value handling, irregularity handling, patient representation, static data inclusion, attention mechanisms, interpretation, incorporating medical ontologies, learning strategies, and scalability. This study summarizes research insights from these literature streams, identifies several critical research gaps, and suggests future research opportunities for deep learning in patient time series data.
- North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
- Oceania > Australia (0.04)
- North America > United States > Virginia (0.04)
- (9 more...)
- Research Report > New Finding (1.00)
- Overview (1.00)
- Research Report > Experimental Study (0.93)